DISPAQ: Distributed Profitable-Area Query from Big Taxi Trip Data †
نویسندگان
چکیده
One of the crucial problems for taxi drivers is to efficiently locate passengers in order to increase profits. The rapid advancement and ubiquitous penetration of Internet of Things (IoT) technology into transportation industries enables us to provide taxi drivers with locations that have more potential passengers (more profitable areas) by analyzing and querying taxi trip data. In this paper, we propose a query processing system, called Distributed Profitable-Area Query (DISPAQ) which efficiently identifies profitable areas by exploiting the Apache Software Foundation's Spark framework and a MongoDB database. DISPAQ first maintains a profitable-area query index (PQ-index) by extracting area summaries and route summaries from raw taxi trip data. It then identifies candidate profitable areas by searching the PQ-index during query processing. Then, it exploits a Z-Skyline algorithm, which is an extension of skyline processing with a Z-order space filling curve, to quickly refine the candidate profitable areas. To improve the performance of distributed query processing, we also propose local Z-Skyline optimization, which reduces the number of dominant tests by distributing killer profitable areas to each cluster node. Through extensive evaluation with real datasets, we demonstrate that our DISPAQ system provides a scalable and efficient solution for processing profitable-area queries from huge amounts of big taxi trip data.
منابع مشابه
Revealing daily travel patterns and city structure with taxi trip data
Detecting regional spatial structures based on spatial interactions is crucial in applications ranging from urban planning to traffic control. In the big data era, various movement trajectories are available for studying spatial structures. This research uses large scale Shanghai taxi trip data extracted from GPS-enabled taxi trajectories to reveal traffic flow patterns and urban structure of t...
متن کاملImproving Viability of Electric Taxis by Taxi Service Strategy Optimization: A Big Data Analysis of New York City
Electrification of transportation is critical for a lowcarbon society. In particular, public vehicles (e.g., taxis) provide a crucial opportunity for electrification. Despite the benefits of eco-friendliness and energy efficiency, adoption of electric taxis faces several obstacles, including constrained driving range, long recharging duration, limited charging stations and low gas price, all of...
متن کاملMeasuring the Efficiency of Urban Taxi Service System
The taxi service systems in big cities are immensely complex due to the interaction and self-organization between taxi drivers and passengers. An inefficient taxi service system leads to more empty trips for drivers and longer waiting time for passengers, and introduces unnecessary congestion to road network. Although understanding the performance of urban taxi service system is important, the ...
متن کاملOpenStreetCab: Exploiting Taxi Mobility Patterns in New York City to Reduce Commuter Costs
The rise of Uber as the global alternative taxi operator has attracted a lot of interest recently. Aside from the media headlines which discuss the new phenomenon, e.g. on how it has disrupted the traditional transportation industry, policy makers, economists, citizens and scientists have engaged in a discussion that is centred around the means to integrate the new generation of the sharing eco...
متن کاملHigh-Performance Spatial Join Processing on GPGPUs with Applications to Large-Scale Taxi Trip Data
Spatially joining GPS recorded locations with infrastructure data, such as points of interests, road network, land cover and different types of zones, and assigning a point with its nearest polyline or polygon is a prerequisite for trip related analysis, which is becoming increasingly important in ubiquitous computing. However, existing spatial databases and GIS are incapable of handling large-...
متن کامل